Query Extension of Retrieve System Using Hangul Word Embedding and Apriori
نویسندگان
چکیده
منابع مشابه
Novel Query Expansion Technique using Apriori Algorithm
One problem in query reformulation process is to nd an optimal set of terms to add to the old query. In our TREC experiments this year, we propose to use the association rule discovery (especially apriori algorithm) to nd good candidate terms to enhance the query. These candidate terms are automatically derived from collection, added to the original query to build a new one. Experiments conduct...
متن کاملEmbedding Word Tokens using a Linear Dynamical System
Low dimensional representations of words allow accurate models to be trained on limited annotated data. While most word representations are context-independent, a natural way to induce representations for words in their particular context is to perform inference over latent variables in a probabilistic model. Given the recent success of continuous vector-space word representations, we provide s...
متن کاملAcronym Disambiguation Using Word Embedding
According to the website AcronymFinder.com which is one of the world's largest and most comprehensive dictionaries of acronyms, an average of 37 new human-edited acronym definitions are added every day. There are 379,918 acronyms with 4,766,899 definitions on that site up to now, and each acronym has 12.5 definitions on average. It is a very important research topic to identify what exactly an ...
متن کاملQuery Subtopic Mining Exploiting Word Embedding for Search Result Diversification
Understanding the users’ search intents through mining query subtopic is a challenging task and a prerequisite step for search diversification. This paper proposes mining query subtopic by exploiting the word embedding and short-text similarity measure. We extract candidate subtopic from multiple sources and introduce a new way of ranking based on a new novelty estimation that faithfully repres...
متن کاملSuMACC Project's Corpus - A Topic-Based Query Extension Approach to Retrieve Multimedia Documents
The SuMACC project aims at automatically tracking new multimodal entities on Internet. The goal of the project is to propose robust multimedia methods that define relevant patterns allowing to automatically retrieve these entities. This paper describes the SuMACC corpus collected on video-sharing platforms using word-queries. Since concepts are limited to a single or few words, querying video-s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of Advanced Navigation Technology
سال: 2016
ISSN: 1226-9026
DOI: 10.12673/jant.2016.20.6.617